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Abstract 



[ We develop a method for the model reference adaptive control (MRAC) 

C^l ■ of LTI systems via Extremum Seeking (ES). Our proof of global asymptotic 

tracking enables design of the adaptive controller to satisfy averaging re- 
quirements, and convergence of tracking error. Our method is novel, with 
the additional advantage that no perturbations need be added to the refer- 
ence trajectory and no a priori knowledge of parameter signs is needed. We 
illustrate our results for a simulated second order system. 

Keywords: adaptive control, Lyapunov stability, extremum seeking, 
averaging 
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1. Introduction 

Many engineered systems such as robots carrying objects with unknown 
q ■ inertias, or unmanned vehicles subject to uncertain forces, are yet expected 

to guarantee performance. Dealing with such systems has motivated the 
problems of adaptive control. Adaptive control has a long history, dating 
back to the 1922 paper of Leblanc whose scheme may have been the first 
^ ; "adaptive" controller. Designing autopoilots for aircraft motivated adaptive 

control in the 1950s 2J, followed by developments of self-adjusting schemes 



such as M.I.T. rule [3| and gradient estimation |4j, |5j. Over theyears, several 
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tneyears, sev 

solutions have been provided to this fundamental problem 0, S S u M, im 



Man y m ethods of adaptive control have been proposed, e.g. extremum 
seeking 13, 14, 15], self tuning regulators 0, [l6[, direct and indirect adapta- 
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tions [17] , and adaptive back-stepping [l8|, [19] . Our motivation is control 
problems that require both transient and steady state performance. There- 
fore, we chose to adapt model reference control (MRC) since it can regulate 
the transients. MRC uses a reference model to specify the ideal response of 
the plant. 

In this paper, we use extremum seeking (ES) loops, based on sinusoidal 
perturbations to optimize cost functions based on the tracking error of MRC. 
We prove global asymptotic tracking of the adaptive system, and develop 
systematic design guidelines to satisfy the conditions of the stability proof. 
This work complements our initial attempt at ES-MRAC 20 . 



ES-MRAC confers advantages over classic MRAC: avoiding perturbation 
of the reference signal, and imposing no requirements on the signs of pa- 
rameters. ES-MRAC is a novel paradigm for adaptive control and opens 
up many interesting theoretical and practical problems, which we list in our 
conclusions. 

Our paper is organized as follows. We supply essential background from 
prior art in section &\ our main result in section [21 and an illustrative example 
in section |H In section we list some of the questions opened up by our 
results. 



2. Essential Prerequisites 

This section introduces some of the preliminary notions, definitions, and 
prior results required for thorough understanding of the paper. Our goal is 
to acquaint the reader with the tools that we use in section |3j 

2.1. Extremum Seeking 

Extremum seeking is a powerful tool for obtaining the extremum (min- 
imum or maximum) value of a map. Hence, it is used in many control 
applications, where the reference-to-output map has an extremum value. 

Suppose we have a system with an input 9, and an output y, which is 
an unknown function of the input, y = f(9). Without loss of generality, we 
assume that the mapping, f(8), has a minimum value. Extremum seeking 
deals with the problem of finding the optimal input, 9*, that would drive the 
output, f{9), to its minimum value, /*. The basic extremum seeking scheme 
is shown in Figure [TJ The perturbation signal sin cut, provides a measure of 
gradient information of the map f(9). The result is summarized as follows. 
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Figure 1: The basic extremum seeking scheme with sinusoidal perturbations. 



Theorem 2.1 (see |l5|] ) . For the system in FigureUi the output error y—f* 
achieves local exponential convergence to an 0(a 2 + l/u 2 ) neighborhood of the 
origin, provided the perturbation frequency u is sufficiently large, and 1+ X L ^ 



is asymptotically stable, where L(s) 



kaf" 
2s ■ 



Our objective in this paper is to develop an adaptive controller that min- 
imizes both the tracking errors and the errors in estimating the unknown 
parameters. Thus, we use extremum seeking as a means of optimizing a non- 
linear cost function of the errors. In simple words, this cost function plays 
the role of the mapping f{9) in Figure [U This process yields an adaptation 
law which updates the gains of the controller. The use of a nonlinear cost 
function, along with the time dependent perturbation signals, lead to a non- 
autonomous, nonlinear set of equations. We use averaging and Lyapunov 
theory for stability analysis. Averaging removes time dependence from the 
equations and makes standard Lyapunov analysis possible. We reproduce 
essential definitions and theorems here for completeness and convenience. 

Definition 2.1 (Averaging, [21]). Consider the non- autonomous system 
x = ef(t,x,e), where e is a small positive constant, and x G D C R n . 
Suppose that f is T -periodic in t, i. e. for all t > 0, fit + T, x, e) = f(t, x, e) . 
We obtain the "average system" by 

i av &fav{%av)i (1) 

where f av (x) = ± / Q T f(r,x,0)dr. 



Theorem 2.2 (see [2lj]). Consider the system 

x — ef(t,x,e), e > 0, (2) 
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where f and its partial derivatives with respect to (x, e) up to the second order 
are continuous and bounded for (t,x,e) G [0, oo) x D x [0, e ], for every 
compact set Dq C D, where D C R n is a domain. Suppose f is T -periodic 
in t for some T > 0. Let x av (et) and x(t,e) denote the solutions of (QP and 
(0|) ; respectively. If x av (et) G D V t G [0, oo) and x(0,e) — x ov (0) = 0(e), 
i/ien t/iere exists e* > stzc/i i/iai /or a// < £ < £*, x(t, e) is defined and 

x(t, e) — x av (et) — 0(e) on [0, oo) (3) 



We assume that the reader is familiar with the standard theorems regard- 
ing global asymptotic stability. Hence, we shall not provide such theorems 
here (see section 4.1 of 2l| for example). 

We are now equipped with the tools required to carry out a rigorous 
analysis on the use of the method of extremum seeking in adaptive control. 
The problem formulation and the main results are described in the following 
section. 



3. Main Results 



In this section, we generalize our results from 22] to a single input multi 
output (SIMO) LTI system of order n. We show that with the proposed 
adaptation procedure, an LTI system of an arbitrary order n with full-state 
measurement, can achieve global asymptotic tracking for all states. 

Suppose that we have a linear system of order n, with control input 
u G R. Without loss of generality, we assume that the governing equations 
are written in the form, 

a n y {n) + an-iZ/ (n-1) + • • • + a y = u, (4) 

where yM g -R, (i = 0, . . . ,n) are measurable states, and a , a 1 , . . . ,a n 
are system parameters. Furthermore, assume that all the parameters are 
unknown to the designer, i.e. = 0, . . . ,n are unknown. Note that the 
usual MRAC requires the sign of a n to be known, whereas this methodology 
puts no restriction on the sign of a n . Hence we can deal with situations in 
which no a priori knowledge on the sign of parameters is given. The objective 
is to design a control law to track the reference model 



yt ] + a m {n-\)vt 1} + • • • + a m0 ym = r(t), (5) 
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where a mn p n + a m ( n _i)P n 1 + . . . + a m0 is Hurwitz. In the above equation, 
r(t) denotes the reference signal, y$ e R, (i — 0, . . . , n) are the states of the 
reference model, and a m o, o m i, . . . , a mn are known constants. 

The method that we propose, provides a control law, u, such that the 
system dynamics, (jl]), will follow the reference model ,fl5J), and uses the 
same tracking error as MRAC. Uncertainties are then, taken into account 
by adaptation of control parameters via Extremum Seeking (ES) loops for 
each parameter. Hence, we call it ES-MRAC. Prior to introducing the sta- 
bility theorem, we provide the following definitions. 

Definition 3.1. The 'auxiliary signal', z(t), is defined as follows 

z(t) 4 y W - fc.-xe*- 1 ) - . . . - Poe (6) 

with the design paramters, 0i, chosen such that the polynomial p n +(3 n -ip n ~ 1 + 
. . . + (3q is Hurwitz, and y m being the reference model as governed by |3p. 

The system parameter estimate is denoted by dj, (i = 0, . . . ,n), and is 
adapted via an extremum seeking block as shown in Figure |2J with a suitable 
choice of a cost function J, a compensator C(s), and a probing frequency Wj. 

Definition 3.2. The 'perturbed estimate' of a parameter is denoted by di, 
(i = 0, . . . ,n), and defined as the estimate of the parameter after being 
perturbed by the sinusoidal signal. In other words, 

tii = ai + Ci sin corf (7) 

Definition 3.3. The 'control input' is defined as 

u 4 a nZ (t) + a n - iy {n - 1] + . . . + a y (8) 

where y is the system state as governed by Op, and z(t) is the auxiliary signal. 



Definition 3.4. The 'tracking error' is defined as the difference between the 
system state and the reference model state, i.e. 

e = y-y m (9) 

The 'tracking error vector' refers to x = [xi, . . . , x n } T — [e, e, . . . , e^ 1 ^ . 
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J(x) 



C(s) 



Ci sm(ujit) sm(u)it - fa) 

Figure 2: ES block for updating parameters. 



Theorem 3.1. For an LTI system of order n given by with the control 
input u, (Ep, and the adaptation law for parameter estimate hi, i — 0, . . . , n, 
as given by the ES block shown in Figure®, let the cost function in Figure® 
be 



t 1 r t i 2 1 



n 
,k=l 



(10) 



where q = [q\, . . . , q n ] T is the vector of weighting factors, and~x. is the tracking 
error vector, and let the compensator in Figure d be 



C(s) = - 9i 



1 + diS 



(11) 



Furthermore, let the probing frequency for each ES loop be given by oji = niU, 
Hi G N, i = 0,...,n with ^ rij for i ^ j. Then, there exist design 
parameters Ci,di,gi,Ui, fy, and qu, (i = 0,...,n, j = 0, ...,n— 1, k = 
1, . . . , n), such that this setup will guarantee global asymptotic convergence of 
the tracking error vector^., to an 0(l/u) neighborhood of the origin, provided 
that the probing frequency, u, is sufficiently large. 

Remark 3.1. In the above theorem, design parameters refer to the following. 

• weighting factors of the cost function q = [g 1; . . . , q n ] T , 

• compensator gains, gi and d{, 

• fa,..., p n -i in Eq. 

• Cj and Ui (probing amplitude and frequency), in Eq. (0). 
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The proof of this theorem is postponed to AppendbfAl However, we present 
the following lemma which helps prove the theorem. The proof of this lemma 
provides insight into how we can choose the design parameters. 

Lemma 3.1. Let x be the tracking error vector, and let the average tracking 
error, x av , be calculated as in Definition ^. 11 If the conditions of Theorem 
\3.1\ are satisfied, then x av — > globally and asymptotically. 

Proof. We start by deriving the governing dynamics, which includes the 
state tracking error dynamics, and the parameter estimation error dynamics. 

Governing Dynamics: 

To find the state tracking error dynamics, we substitute (J7]) and (jSJ) into 
OH]), and the resultant into (j3J. Defining the state tracking error as in flUJ), 
and the parameter convergence error as 

a i = ai-di i = 0, . . . ,n , (12) 

we rearrange the resulting equation to show that the state tracking error 
dynamics is given by 

a n [e< n > + /Vie'"" 1 ) + . . . + /3 e) 

n-l 

= ^(c fc sin u k t - a fc )y (fc) + (c n sin u n t - a n )z(t) 

k=0 

Using Definition I3.4[ we rewrite the state tracking error dynamics in state 
space form as 

x = Ax + — bv T [<S - a] (13) 

a n 

where 
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(14) 
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a = [do, . . . , d n ] T , and S denotes the vector of perturbation signals given by 
S = [c smuot, ... ,c n sin cu n t] T . In addition, the vector v is the regression 
vector given by v = [y, y, . . . , y^ n ~ l \ z] T . 

According to Figure |2l the parameter estimation error is governed by 

di — gi(l + dis)[sxn(u)it - fa) J), i = 0,...,n (15) 

Applying s to the term in brackets, 

— 9i ( sm(cjit — (pi) + diUJi cos(cjjt — fa) j J+gidi sm(ojit — fa) J (16) 

where J is given by ( ITUj) . and i — 0, . . . , n. In order to write the last equation 
in vector form, we define a gain matrix G = diag(#j) G _R( n + 1 ) x ( n+1 ) ; % = 
0, . . . , n + 1, and perturbation vectors, di(i) and d 2 (t) 



di(f) 



sin(wot — fa) + d ujQ cos(a;o^ — fa) 
sin(a;it — (p\) + d\UJi cos(u;i£ — (p\) 



sin(a; n t - <p n ) + d n u n cos(u n t - 
Thus, we write (TT6l) as 



d 2 (t) 



d sm(u t - fa) 
di sin^t — (pi) 



d„ sinful — 



n" fn I 



a = Gd!J + Gd 2 J 



(17) 



Equations (fT3l) and (|T7l) . constitute the governing dynamics. These equations 
are non-autonomous since time appears explicitly in them. 



Averaging: 

Let the greatest common factor of all the probing frequencies, = 
0, . . . ,n, be denoted by u. In other words, Ui = riiUj, where 6 N, % = 
0, . . . ,n. Furthermore, assume that rij ^ rij for i ^ j. This will guarantee 
orthogonality of the sinusoidal perturbations. Let the design parameters be 
chosen such that 

w>l (18) 
0{d i u i ) = 1, i = 0,...,n (19) 
0( 9l ) = l, z = 0,...,n (20) 
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In order to be able to perform averaging, we scale the time as follows. 
Suppose that r = cut, and define e = (1/w) < 1. By substituting r = out 
into f|T3|) . and using Definition 12.11 one can show that the averaged equation 
for tracking error dynamics is given by 



Similarly, performing the same procedure on ( ITTj) . one can show that the 
averaged equation for parameter estimation errors is given by 



where C = diag(gi<ijQ cos 4>i) £ _R( n+1 ) x ( n+1 ) ) % = o, . . . , n. AppendbQH pro- 
vides the details on how averaging can lead us from ( 1T71) to ( 1221 . 

Note that the perturbation amplitude, q, is chosen so as to produce a 
measurable variation in the plant output at the corresponding frequency. 

Lyapunov analysis: 

Now, we are ready to use Lyapunov stability analysis to prove convergence 
of averaged tracking error, x av , to zero. Consider the following Lyapunov 
function 



where P, and T are symmetric positive definite matrices. Without loss of 
generality, we assume that e = 1, and conduct the stability analysis. We shall 
use the prime symbol to denote differentiation with respect to r. Taking the 
derivative of V with respect to r and noting that P and F are symmetric 
matrices, we can write 






(22) 




(23) 




■av 



(24) 



Substituting ([2"Tj) and f l2"2"|) . and simplifying, we get 



V = xf v (A T P + PA)x av + —SlI [g n rCv av q T - v av b T P] x, 



■av 



which we write as 



V = -xf v Qx av + — af v [g n rCv av q T - v av b r P] x ; 



■av 



(25) 
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with Q = — (A T P + PA) a positive definite matrix. By choosing the pa- 
rameters in P, C, q, and T such that 

g„rCv av q r - v av b T P = 0, Vv av G R {n+1) (26) 

we get 

V = -xf v Qx av < 0, (27) 

that is, V is negative semi-definite. This implies that V(r) < V(0), and 
therefore, x av , and a av are bounded. Moreover, v av is also bounded by def- 
inition, since all its components are linear combinations of elements of x av 
and the reference model. Therefore, V" = — 2xf v QAx av + ^-a^ v v av b T Qx av 
is also bounded. Hence V is uniformly continuous. Therefore, by Barbalat's 
lemma, V — > as t — > oo. Hence, according to (|27|) . x av — > as t — > oo, 
i.e. tracking error and all its derivatives converge asymptotically to zero. 
Furthermore, since V — > oo as ||x av || — > oo, global asymptotic tracking is 
achieved. 

The following corollary, which follows from the above proof, provides some 
guidelines as how to choose the design parameters. 

Corollary 3.1 (Design). The design parameters Ci,di,gi,Ui, (3j, and qk, 
(i = 0, . . . , n, j = 0, . . . , n — 1, k = 1, . . . , n) in Theorem \3.1[ must be chosen 
such that the following holds 

1. u > 1 

2. O(di0Ji) = 1, i — 0, . . . , n 

3. 0{gi) = 1, i = 0, . . . ,n 

4. The matrix P = [Py] is found by solving the identity PA + A T P = — Q 
for some positive definite matrix Q ; with A defined as in [IM . 

5. Eigenvalues of the matrix q n TC G _R( n+1 ) x ( ra+1 ) must satisfy 

\ _ \ __ \ _ QlPnl + ?2-Pn2 + • • • + QnPnn / oq n 
Al - A 2 - ■ ■ ■ - \ n+ l) ~ 2 2 T —3 (^ 

</l T 12 T • ■ • T (/„ 

For i/ie special case where Y is diagonal, Y = diag^Q, . . . ,y n ), this 
equation simplifies to 

q n gidiCi cos 0^ = 2— — 2— ■ — , z = 0,...,n (29) 

ql + qi + ... + ql 
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For example, we start by choosing the weighting factors and a positive 
definite matrix Q, and solve for P using condition H] above. Next, we choose 
large enough q's to produce measurable variations in the plant output in the 
presence of noise. Then, for each parameter estimate, the gains are tuned by 
choosing gi, and such that their orders of magnitude satisfy conditions [2] 
and [31 and their values satisfy (1291) for some positive definite diagonal matrix 

r. 

Remark 3.2. The last condition in Corollary \ 3.1\ follows from simplifica- 
tion and mathematical interpretation of l[2o}) as follows. Right multiplying 
(TJ?2p by q and noting that q T q and b T Pq are scalars, we get 

(re - -^-^i (n+1) ) v av = o, vv av g (30) 

which is basically the characteristic equation of the matrix TC, i.e. its eigen- 
values are all identically equal to — b J* 01 . 

Remark 3.3 (Geometrical Interpretation). Equation §2B±) can be inter- 
preted as follows. Consider the vector of weighting factors q, and the vector 
P T b in the hyper space R n . Let the projection of vector P T b onto the vector 
q be denoted by Pi. Since P 1 is in the same direction as q, we can write it 
as a coefficient of q, as follows 

Pi = for some k G R 

This coefficient is given by 

(P T b, q) b T Pq 



K 



(q, q) q T q 



which, according to equals the eigenvalues of the matrix q n TC. There- 

fore, for a chosen diagonal matrix T, an increase in the projection length, 
k, increases the eigenvalues of C, hence an increase in the rate of change of 
parameter estimates (as seen by equation [2^) ). 

Corollary 3.2. If the reference signal r(t) is a unit step, then ES-MRAC 
guarantees that 

(i) Vj G {0, . . . , n) a aV j — > kj, as t — >■ oo for some kj G R, and 
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(a) k = o. 



Proof. In Lemma I3TT| we showed that x av — > globally and asymptotically. 
Substituting x av — > into ( I2"2"j) proves the first part of the corollary. To prove 
the second part, we consider the steady state solution of (121"]) . Substituting 
x av — > into f l2T]) yields the following 



bv a Av -> 



(31) 



Substituting for b, and v av , we get 

... 



o 

2/av 





(n-1) 



Qav,0 

^a,v,n— 1 
O-av^ 







(32) 



Since x av —> 0, therefore, y av — )• y m as t — )• oo. Using final value theorem, 
one can easily show that for a unit step reference signal r(i), lim^oo y m (t) = 
l/a m Q. Thus, fl32|) will simplify to 



fl m0 



(33) 



which is the same as k n = 0. 



4. Second Order Example 

As a special case of higher order systems, consider the equation of motion 
for a linear second order system 

a 2 jj + a x y + a y =u (34) 

with unknown ao, ai and The control objective is to design a state 
feedback control law u, such that the system follows the reference dynamics 
given by 

a m 2y m + a m iy m + a m0 y m = r(t) (35) 

We use theorem 13.11 and Corollary 13.11 to accomplish this. These results 
simplify the design process into a few easily carried out steps. In general, 
we divide the design process into two parts. That is, determination of con- 
trol/adaptation schemes, and gain tuning. 
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4-1. Determination of Control and Adaptation 
Step 1: Control 

Use n = 2 in Theorem 13.11 to provide the control law as follows 



u = a 2 z + aiy + a^y 



(36) 



with the auxiliary signal z(t) = y m — Pie — (3 e and a n = a n + CnSmojnt, 
n = 0,1,2. 

Step 2: Adaptation 

For each of the unknown parameters, do, ai, and a 2 , set up the adaptation 
block as given by Figure [21 using distinct frequencies. For each block, the 
cost function J is given by ffTOl) . 



and the compensator C(s) is given by ( fTTj) . 

Given numerical values, one can already implement the above laws, but 
it is very difficult to design proper gains such that the method performs 
well. Therefore, in what follows, we shall provide some insights into the gain 
tuning process. 

4-2. Gain Tuning 

In general, tuning the gains for optimal performance is very difficult in 
adaptive control, especially in MRAC One advantage of ES-MRAC is that 
it provides some guidelines on gain tuning. In what follows, we shall use the 
results of Corollary 13.11 in a step by step manner, to help us in the process 
of gain tuning. Note that these steps are only provided here as guidelines, 
and one can use a different procedure to tune the gains. Moreover, these 
steps help the designer with a quick estimate of the orders of magnitudes of 
different gains, but will not necessarily provide optimal gains. 



Choose 0o and j5\ such that p 2 + flip + (3o is Hurwitz. These parame- 
ters effect the rate of convergence of tracking error. Then, the A matrix is 
determined from (I14p 



(37) 



Step 1: 



A 



1 

-A) -Pi 



(38) 
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Step 2: 

Choose the coefficients qi and q2 of the cost function J, as defined in ffTO]) . 
StepS: 

Pick a positive definite matrix Q and solve the identity PA + A T P = — Q 
for the 2x2 matrix P. 

Step 4 : 

Choose the probing frequencies of the sinusoidal perturbations uj , Ui, 
and Co>2, such that the sinusoidal terms are orthogonal. Once these values are 
determined, pick the compensator gains do, d±, and g?2 such that 0(diU>i) = 1, 
% = 0,1,2. 

Step 5: 

Finally, decide the values of gi, Ci, <pi, and 7, such that ( 1291) holds. Each 
of these design parameters contribute differently to the control problem. It 
is desirable to make 7j's small, since they have an inverse effect on the rate 
of convergence of parameters (See Remark 13.31) . The values for q's provide 
the amplitude of the perturbation signals. The q's should be large enough to 
produce measureable variations in the plant output, but cannot be too large, 
since they can cause instability, and excitation of higher dynamics which is 
undesirable. Finally, the compensator gains, g^s, need to be large, since they 
can increase the rate of change of parameters, as seen by (|22|) . 

4-3. Numerical Example and Simulations 

We use the following numerical values for our simulations. Let the true 
parameters of the system a2y + a\y + aoy = u, have the values ai — 1, a\ — 3, 
and ao = 6.25. Let the model reference be given by y m + 4.2y m + 9y m = r, 
which corresponds to a system with a natural frequency of u n = 3 rad/sec 
and damping ratio of £ = 0.7. The initial conditions are assumed to be zero 
for the model reference and [y(0),y(0)] T = [— 0.1,0. 2] T for the plant. The 
reference input r is assumed to be a unit step function. 

It is assumed that there is no a priori knowledge of the ideal values of 
parameters. Figures [3] and H] demonstrate the performance of the system 
when the design parameters are chosen as follows: Let f3 — 9, /?i = 3, 
perturbation amplitudes c = 0.3, C\ = c 2 = 0.2 perturbation frequencies 
ujq = 5 rad/sec, U\ — 8 rad/sec, 0J2 = 14 rad/sec damping coefficients d\ = 
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Figure 3: Convergence of tracking error to zero for ES-MRAC in a 2nd order system. 



d 2 = d 3 = 0.1, and gains g\ = 9000, g 2 = 3200, g 3 = 2000. The matrix T is 
chosen to be T = 0.01I 3 . 

We compare our results with those of MRAC applied to the same system. 
Definition of T in chapter 8 of [loj for MRAC, is the inverse of what we have 
defined. Therefore, we use T = diag(90, 90, 50) for MRAC. Although this 
is not the exact inverse of the matrix 0.01I 3 , it produces better results for 
MRAC. 



5. Conclusion 

We proposed a new approach to Model Reference Adaptive Control (MRAC), 
in which adaptation is carried through Extremum Seeking (ES), and we called 
it ES-MRAC. The proposed scheme was presented for a general class of LTI 
systems and proof of global asymptotic tracking was provided. The results of 
this paper open up many questions of practical relevance. Extensions to out- 
put feedback, direct adaptive control, and feedback linearizable systems may 
all be possible. Adaptive back-stepping for nonlinear systems and standard 
gradient and least square based adaptive controls may also have ES-MRAC 
counterparts. 
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Parameter convergence Parameter convergence 
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Figure 4: Convergence of parameters, an, and 0,2 reach their true values, while a\ only 
reaches a constant. 
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A. Proof of Theorem 13.11 



Proof. In order to study the properties of the actual system, we note that 
we can write the governing equations of the system, as x = f(x,t). That is, 

f(x, t), with x 



( TT3"j) and (IT7|) can be written as x 
f(t,x) = 



-TlT 



|x" , a J r , and 



'Ax+ ^bv T [S-a] 

in i _ _ 1 

GdxJ + GdaJ 



(A.l) 



As shown in section [3j we scale the time using r 
e=l/w<l, Thus we get 



out, with u 3> 1, and 



dx 
dr 



ef(r,x) 



(A.2) 



As we can see from ICQ) . /, |f, and g are continuous. Furthermore, we 
note that on any compact set D C D C R 2n+1 , x <E D is bounded. That 
means that x and a are bounded. Therefore, according to ( lA.lj) . f(t,x) 
(similarly f(r,x)) is bounded. Moreover, f(r,x) as defined above is 2n- 
periodic, since we are only using sinusoidal perturbations, and have defined 
u to be the greatest common factor of all probing frequencies (see section |3l 
equation (TT81). 

According to Lemma I3.1[ for the averaged system ( 12"T|) and ( I2"2"j) , x av — > 
globally and asymptotically, and a av is bounded. Therefore, denoting the 
averaged solution for the overall system by x av (er) = [x av , a^] T , we see that 
the averaged system is globally bounded, i.e. x av (er) G D Vr G [0, oo). 

Hence according to Theorem 12.2} solving the average system with the 
same initial state as the original system yields 



• ' | — ) ()!! [0. OC.) 



(A.3) 



where we substituted e = 1/oj. Equation ( 1A.3j) means that the solution to 
the nonautonomous system is only 0(1/ ui) different than the solution of the 
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average system, i.e. x(r) — x av (5r) = 0(l/u)), and a av (r) — a(er) = 0(l/u). 
Since we showed in Lemma 13.11 that x av — )■ globally and asymptotically, 
thus the vector x converges globally and asymptotically to on 0(1/ ui) neigh- 
borhood of the origin. 

B. Details of Averaging the Adaptation Law 

As an example of how averaging is performed, we shall provide details 
of the averaging for the adaptation law. We will show how equation f[2"2l is 
found by averaging (flT|) . 

We start by defining an averaging operator, using Definition 12.11 

Definition B.l. The averaging operator, AVG^.j, for a T -periodic func- 
tion f(x,t,e) is defined as 

AVG (f(x, t, e)) = - J T f(x, r, 0)dr (B.l) 

Remark B.l. The AVG operator is linear 

AVGCd/j + c 2 f 2 ) = Cl AVG(/ a ) + c 2 AVG(/ 2 ), Vc a , c 2 e R 
Next, we substite J — \ (q T x) 2 , and its derivative into ( JT71) 

a = ^Gdi (q T x) 2 + Gd 2 (q T x) (q T x) (B.2) 
Substituting for x from ( fl3l) would then give 

i = iGdx (q T x) 2 + Gd 2 q T xq T (Ax + — bv T [<S - a] ) (B.3) 

Recall that we defined the perturbation frequencies as Ui = UiU, where 
rij e N, % = 0, . . . , n, with a; defined as the greatest common factor of all the 
frequencies. Furthermore, assume that n« ^ rij for % ^ j. Now, if we scale 
( IB.3P using t = ut, we get 
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with 



di(r) 



and 



sin(n r - O ) + ^o^o cos(n r - O ) 
sin(n 1 r — (pi) + diUi cos^r — (pi) 

sin(n n r - (p n ) + d n u n cos(n„r - (p n ) 



, Mr) 



d sin(n r - O ) 
di sin(nir — (pi) 

d n sm(n n r - (p n ) 



si?) 



Co sin uqt 
c\ sinnir 

c n sin n„r 



(B.4) 



Denning e = l/oo, with ou >> 1, enables us to perform the averaging as 
follows 



aLv=(^J =,AVG(^-Gd 1 (r)(q r x)' 



+eAVG I Gd 2 (r)q T xq T 



Ax — — bv T a 



+eAVG [ — Gd 2 (r)q T xq T bv T 5(r) 

On, 



(B.5) 



The first term can be written as 



AVG Qc dl (T) (q T x) 2 )=iG AVG ( dl (r) (q r x) 2 ) 



= -G AVG(d 1 (r))(q r x av ) 2 

=0, 

since di(r) is a zero mean periodic function over T. In a similar fashion, 
one can show that the second term also averages out to zero. Thus we are 
left with 



k' av =sA\G ( — Gd 2 (T)q T xq r bv T S(r 



(B.6) 
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Since v T iS(r) is a scalar, we have v T iS(r) = iS T (r)v. Thus we get 

a^ v =eAVG ( — Gd 2 (r) l S T (r)vq T xq T b^ 

\a n J 

=^ G AVG (d 2 (r)S T (r)) v av q T x av q T b 

a n 

However, 

d 2 (r)S T (r) = 

doCo sin(n r — O ) sin n r • • • doC n sin(n r — O ) sin n n r 

d n c sin(n n r - n ) sin n r ■ ■ • d n c n sin(n n r - n ) sin n n T 
One can easily show that 

1 f T= ^ ■ < ±\ • j / ^3 
— / sminjT — (Bj ) sin n,r dr = < i , . . 

T J \ ±cos0 4 z = j 



(B.7) 



(B.8) 



Thus 



AVG (d 2 (r)5 T (r)) 



co<io cos 0o • • • 
c\d\ cos 0i- • • 







• • -c n d n cos 



(B.9) 



Substituting this into (E7]), and multiplying with the diagonal matrix G 
yields 



2a r , 



g c d cos 0o 

giC\di cos 0i- 












• • -gnCndn COS ( 



v av q T x av q r b 



Since, we have defined C = diag(gj<ijCj cos0j), and since q T b = q n , the last 
equation simplifies to 



a av = e 7y Cv avq X av 



(B.10) 



which is the same as fl22|) . 
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